# The Ubuntu training corpus From [[Main Library - Chatterbot]] The Ubuntu training corpus is an enormous (3gb) collection of conversational text from Ubuntus tech-support system. It's heavily biased and hopelessly garbage but should allow for a decent starting point. ## Locations ## Structure ## Usage ## Outputs ## Benchmarks ## Notes See [[Why I didn't use Chatterbot]]